Let’s describe the population and LICO-AT by DA:
Median number of care destinations spatially accessible (by transit) within 15 mins from a parcel in each DA:
We will focus on DAs that are in high-high and low-low spatially auto-correlated clusters as determined by local Moran’s I.
Spatial autocorrelation refers to the extent to which a variable is correlated with itself through space. Think “everything is related to everything else, but near things are more related than distant things” (Tobler 1970). Positive spatial autocorrelation indicates that observations are very similar in values (i.e., spatially clustered). Negative spatial autocorrelation occurs when observations with dissimilar values are closer together (i.e., spatially dispersed). Moran’s I statistic can be calculated to describe spatial autocorrelation.
For the whole City of Hamilton, we see a positive spatial autocorrelation:
##
## Moran I test under randomisation
##
## data: access_15_parcel_DA$med_copp_isc
## weights: nbw
##
## Moran I statistic standard deviate = 45.692, p-value < 2.2e-16
## alternative hypothesis: greater
## sample estimates:
## Moran I statistic Expectation Variance
## 0.8965694067 -0.0011337868 0.0003860017
Moran’s I statistic is 0.8965694, very close to 1 (i.e., positive spatial autocorrelation), with a p-value of lower than 0.05 indicating significance. We can reject the null hypothesis and conclude there is evidence for positive spatial autocorrelation for the whole region. Positive spatial autocorrelation indicates that neighbours are very similar in values. This can be clusters of high values, and clusters of low values.
But where are the clusters? IN the following we calculate the local Moran’s I – this involves tabulating the spatial moving average for each DA (the average value of all the neighbours around each DA, for each DA). We only include the Hamilton Central area as the rest if Hamilton is for the majority more rural (different) and we need not be concerned with edge effects (NOTE: I also did this analysis for the full City of Hamilton, and if significant, only Hamilton Central contains high-high clusters, rest are low-low).
The closer the points are to the vertical line – the more spatially autocorrelated the dataset is. In the following, we spatially visualise the statistically significant results where the top right quadrant of the plot above are (high-high) and the bottom left quadrant are (low-low) values. We also add in the ward boundaries within Hamilton Central:
How do we interpret the Local Moran’s I statistic? An advantage is it yields a statistical significance value. In the DAs that are shown in the map above, the are either high-high clusters or low-low clusters. A high-high cluster are DAs that have accessibility values that are high and are surrounded by neighbours that are relatively high in a statistically significant way; meaning, that if the values of Hamilton Central were shuffled and randomly assigned to DAs (i.e., a random ’null hypothesis), the chance of these clusters occurring are less than 0.05%. Same goes for the low-low clusters. Moran’s I is a common statistic used in spatial analysis.
However - we have an issue. As can be seen, practically no DAs are high-high within the Mountain (the 4 most southern Wards). This is likely because a 15-minute travel window is quite narrow for transit mode. If 15 minutes is really important for care trips – then our focus on ‘high’ accessibility DA should be restricted to the LOWER Hamilton-Central (and perhaps pick 8 DAs?). If we really want to include ‘high’ accessibility values on the MOUNTAIN (to show diversity of the city’s urban environment), I suggest we do conduct this analysis assuming a larger travel window - such as 30 minutes. I know the literature suggests care trips are often shorter in length (~15 minutes) but some less frequent trips may be longer (i.e., further trips to hospitals, specific grocery stores) and the use of wider travel windows for transit has been done in the literature (in fact, my sense is 30 mins or longer is more common than 15), so this choice may not be necessarily critiqued.
So, assuming a 30 minute travel time window, we see the new low-low
and high-high Local Moran’s I Plot below:
Now let’s plot the accessibility values for only DAs that have statistically significant quadrant 1 (high-high) and quadrant 2 (low-low) values and some DAs that can be potentially selected (black):
The black outlines are the potentially ‘selected’ DAs for audit and they are as follows:
How were they selected? a high-high and low-low cluster for each the LOWER and MOUNTAIN were selected. A DA with the highest accessibility value out of the high-high DAs in LOWER Hamilton-Central was selected along with 3 neighbouring DAs that are also high-high. For low-low, in LOWER Hamilton-Central, the DA with the lowest accessibility value + contains care destinations was selected and 3 neighbouring DAs also with signifcant low-low spatial autocorrelation was selected. This process was repeated to select 4 more high-high DAs and 4 more low-low DAs on the MOUNTAIN-HAMILTON CENTRAL. Accessibility values in high-high are all Q4 (high values) and low-low values are all Q1 (low values).
The following 3 plots showcase these selected DAs. Let’s see the accessibility, population and LICO for just these DAs (NOTE: click on the DA to see the accessibility, population and LICO-AT values:
OSM Street network with the posted vehicle speeds within 400m buffers around destinations per DA:
*CAUTION WHEN USING SPEEDS FROM THIS MAP. I believe what’s displayed is the whole segment average speed. I’m displaying the road network as used by {r5r} and I can only speculate that it averages the road segment speed to increase computational efficiency. However, in real life, the posted speed varies within parts of each segment. So it would be worth double-check speeds that do not end in 0s in the field / google street view. I’m not certain how to display the OSM road network as-is (unaveraged speeds), and it may not be worth my time figuring out at this point. But let me know!
We can also try another “Local Statistics of Spatial Association” (LISA) approach to detect clusters – the \(G^*\) statistic (Getis and Ord, 1992). In this measure, the spatial weights are not row-standardized and are binary. In the Local Moran’s I we calculated the weighted spatial moving average. In this measure, we instead calculate the total of the variable in the neighborhood (i.e., being the value of the DA and the value of all the DAs touching a DA).
We see from the map above, the result is quite similar as the Local Moran’s I map. Why? I think because positive spatial autocorrelation is quite apparent and what we define as a neighbour is quite similar (i.e., a neighbour for each DAs are the DAs that border it). Is this a good way of defining a neighbourhood? Not sure. We don’t have information on what is a good distance away from a DA to be considered within the same neighbourhood (i.e., no surveys of people’s perceptions). So once again, stats are complex and shed a some light but leave other things unknown.